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LACCASE MUTANTS 

FIELD OP THE INVENTION 

BACKGROUND OF THE INVENTION 



Laccase is a polyphenol oxidase (EC 1.10.3.2) which catalyses 
the oxidation of a variety of inorganic and aromatic compounds, 
particularly phenols, with the concomitant reduction of molecular 
oxygen to water. 

Laccase belongs to a family of blue copper-containing 
oxidases which includes ascorbate oxidase and the mammalian 
plasma protein ceruloplasmin . All these enzymes are multi-copper- 
containing proteins . 

Because laccases are able to catalyze the oxidation of- a 
variety of inorganic and aromatic compounds, laccases have been 
suggested in many potential industrial applications such as 
lignin modification, paper strengthening, dye transfer inhibition 
m detergents, phenol polymerization, hair colouring, and waste 
water treatment. 

No three-dimensional structural information has been 
available for a laccase before. 

We have now elucidated the three-dimensional structure of a 
Coprinus cinereus laccase. By having this three-dimensional 
structure we are able to create laccase variants with altered 
properties: increased oxidation potential and/or changed pH 
optimum and/or altered mediator pathway and/or altered O^/OH'- 
pathway . 



30 



35 



BRIEF DISCLOSURE OF THE INVENTION 

The three-dimensional structure of a laccase has now been 
elucidated. On the basis of an analysis of said structure it is 
possible to identify structural parts or specific amino acid 
residues which from structural or functional considerations 
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appear to be important for increased oxidation potential and/or 
changed pH optimum and/or altered mediator pathway and/or altered 
02/0H--pathway of a laccase . 

Furthermore, when comparing the three-dimensional structure 
of the Coprinus laccase structure with known amino acid sequences 
of various laccases, it has been found that some similarities 
exist between the sequences. The present invention is based on 
these findings . 

Accordingly, in a first aspect the invention relates to a 
method of constructing a variant of a parent Coprinus laccase, 
which variant has laccase activity and increased oxidation 
potential and/or changed pH optimum and/or altered mediator 
pathway and/or altered 02/0H--pathway as compared to said parent 
laccase, which method comprises 

i) analysing the three-dimensional structure of the parent 
Coprinus laccase to identify at least one amino acid residue or 
at least one structural part of the Coprinus laccase structure, 
which amino acid residue or structural part is believed to be of 
relevance for altering the oxidation potential and/or altering 
the pH optimum and/or altering the mediator pathway and/or 
altering the Oj/OH" -pathway of the parent Coprinus laccase (as 
evaluated on the basis of structural or functional 
considerations) , 

ii) constructing a Coprinus laccase variant, which as compared to 
the parent Coprinus laccase, has been modified in the amino acid 
residue or structural part identified in i) so as to alter the 
oxidation potential and/or alter the pH optimum and/or alter the 
mediator pathway and/or alter the O^/OH"- pathway, and, optionally, 

lii) testing the resulting Coprinus laccase variant with respect 
to oxidation potential and/or pH optimum and/or mediator pathway 
and/or 02/OH--pathway . 

In a second aspect the present invention relates to a method 
of constructing a variant of a parent Coprinus- like laccase, 
which variant has laccase activity and increased oxidation 
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potential and/or changed pH optimum and/or altered mediator 
pathway and/or altered O^/OH" -pathway as compared to said parent 
laccase, which method comprises 

i) comparing the three-dimensional amino acid structure of the 
Coprinus laccase with an amino acid sequence of a Coprinus- like 
laccase, 

ii) identifying a part of the Coprinus- like laccase amino acid 
sequence which is different from the Coprinus laccase amino acid 
sequence and which from structural or functional considerations 
is contemplated to be responsible for differences in the 
stability of the Coprinus and Coprinus- like laccase, 

iii) modifying the part of the Coprinus-like laccase identified 
in ii) whereby a Coprinus-like laccase variant is obtained, which 
has an increased oxidation potential and/or changed pH optimum 
and/or altered mediator pathway and/or altered O^/OH"- pathway as 
compared to the parent Coprinus-like laccase, and optionally, 

iv) testing the resulting Coprinus-like laccase variant with 
respect to oxidation potential and/or pH optimum and/or mediator 
pathway and/or O^/OH-pathway . 

In still further aspects the invention relates to variants of 
a Coprinus laccase and of Coprinus-like laccases, DNA encoding 
such variants and methods of preparing the variants. Finally, the 
invention relates to the use of the variants for various 
industrial purposes . 

30 DETAILED DISCLOSURE OF THE INVENTION 
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The Coprinus- like laccases 

A number of laccases produced by different fungi are 
homologous on the amino acid level. For instance, when using the 
35 homology percent obtained from UWGCG program using the GAP 
program with the default parameters (penalties: gap weight =3.0, 
length weight=0.1; WISCONSIN PACKAGE Version 8.1-UNIX, August 
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1995, Genetics Computer Group, 575 Science Drive, Madison, 
Wisconsin, USA 53711) the following homology was found: 
Coprinus cinereus laccase comprising the amino acid sequence 
shown in SEQ ID No. 1: 100%; 
5 Polyporus pinsitus (I) laccase comprising the amino acid sequence 
shown in SEQ ID No. 2: 74.4%; 

Polyporus pinsitus (II) laccase comprising the amino acid 
sequence shown in SEQ ID No. 3: 73.8%; 

Phlehia ra.dia.ta. laccase comprising the amino acid sequence shown 
10 in SEQ ID No. 4: 69.9% ; 
i==J ^ Rhizoctonia solani (I) laccase comprising the amino acid sequence 
SA shown in SEQ ID No. 5: 64.8%; 

m Rhizoctonia solani (II) laccase comprising the amino acid 
hi sequence shown in SEQ ID No . 6: 63.0%; 

■;:;=15 Rhizoctonia solani (III) laccase comprising the amino acid 

- sequence shown in SEQ ID No . 7: 61.0%; 
hi Rhizoctonia solani (IV) laccase comprising the amino acid 
O sequence shown in SEQ ID No . 8: 59.7%; 

Scytalidium thermophilum laccase comprising the amino acid 
ao sequence shown in SEQ ID No. 9: 57.4%; 

Myceliophthora thermophila laccase comprising the amino acid 

sequence shown in SEQ ID No. 10: 56.5%. 

Because of the homology found between the above mentioned 
25 laccases, they are considered to belong to the same class of 
laccases, namely the class of " Coprinus- like laccases". 

Accordingly, in the present context, the term " Coprirzus- like 
laccase" is intended to indicate a laccase which, on the amino 
acid level, displays a homology of at least 50% and less than 
30 10 0% to the Coprinus cinereus laccase SEQ ID NO 1, or at least 
55% and less than 100% to the Coprinus cinereus laccase SEQ ID NO 
1, or at least 60% and less than 100% to the Coprinus cinereus 
laccase SEQ ID NO 1, or at least 65% and less than 100% to the 
Coprinus cinereus laccase SEQ ID NO 1, or at least 70% and less 
35 than 10 0% to the Coprinus cinereus laccase SEQ ID NO 1, or at 
least 75% and less than 100% to the Coprinus cinereus laccase SEQ 
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ID NO 1, or at least 80% and less than 100% to the Coprinus 
cinereus laccase SEQ ID NO 1, or at least 85% and less than 100% 
to the Coprinus cinereus laccase SEQ ID NO 1, or at least 90% and 
less than 10 0% to the Coprinus cinereus laccase SEQ ID NO 1, or 
5 at least 95% and less than 100% to the Coprinus cinereus laccase 
SEQ ID NO 1. 

In the present context, "derived from" is intended not only 
to indicate a laccase produced or producible by a strain of the 
organism in question, but also a laccase encoded, by a DNA 
10 sequence isolated from such strain and produced in a host or- 
ganism containing said DNA sequence. Finally, the term is 
intended to indicate a laccase which is encoded by a DNA sequence 
of synthetic and/or cDNA origin and which has the identifying, 
characteristics of the laccase in question . 

15 

The three-dime nsional Covrinus laccase structure 

The Coprinus laccase which was used to elucidate the three- 
dimensional structure forming the basis for the present invention 
consists of the 53 9 amino acids derived from Coprinus cinereus 
20 laccase IFO 83 71 as disclosed in sequence ID No. 1. 

The obtained three-dimensional structure is believed to be 
representative for the structure of any Coprinus-like laccase. 

The structure of the laccase was solved in accordance with 
the principle jfor X-ray crystallographic methods given in "X-Ray 
Structure Determination", Stout, G.K. and Jensen, L.H. , John 
Wiley &- Sons, inc. NY, 1989. The structural coordinates for the 
solved crystal structure of the laccase at 2.2 A resolution using 
the isomorphous replacement method are given in a standard PDB 
format (Brookhaven Protein Data Base) in Appendix 1. It is to be 
understood that Appendix 1 forms part of the present application. 

In Appendix 1 the amino acid residues of the enzyme are 
identified by three-letter amino acid code (capitalized letters) . 

The laccase structure is made up of three plastocyanin- like 
domains. These three domains all have a similar beta-barrel fold. 

3 copper atoms were observed in the three-dimensional 
structure : 

The so-called type 1 copper ion is coordinated by two 
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histidines and one cysteine. 

The so-called type 2 copper of the trinuclear centre is 
missing in the structure disclosed in the present application. 

The so-called type 3 copper consists of two type 3 copper 
atoms (pair of copper atoms) bound to a total of 6 histidine 
ligands . 

When comparing the amino acid sequence of the crystallized 
three-dimensional structure with Coprinus cinereus amino acid 
sequence ID No. 1 the following four differences are observed: 
18 amino acids are missing from the N-terminal of the 
crystallized protein; 

17 amino acids are missing from the C-terminal of the 
crystallized protein; 

Q19 in sequence ID--No. 1 is an Al in the crystallized protein; 
and 

Q243 in sequence ID No . 1 is an E225 in the crystallized protein. 

Generality of structure 

Because of the homology between the Coprinus laccase and the 
various Coprinus- like laccases, the solved structure defined by 
the coordinates of Appendix 1 is believed to be representative 
for the structure of all Coprinus-like laccases. A model 

structure of Coprinus- like laccases may be built on the basis of 
the coordinates given in Appendix 1 adapted to the laccase in 
question by usa of an alignment between the respective amino acid 
sequences . 

The above identified structurally characteristic parts of the 
Coprinus laccase structure may be identified in other Coprinus- 
like laccases on the basis of a model (or solved) structure of 
the relevant Coprinus-like laccase or simply on the basis of an 
alignment between the amino acid sequence of the Coprinus-like 
laccase in question with that of the Coprinus laccase used herein 
for identifying the amino acid residues of the respective 
structural elements. 

Furthermore, in connection with Coprinus laccase variants of 
the invention, which are defined by modification of specific 
amino acid residues of the parent Coprinus laccase, it will be 
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understood that variants of Copirinus- like laccases modified in an 
equivalent position (as determined from the best possible amino 
acid sequence alignment between the respective sequences) are 
intended to be covered as well. 

5 

Methods of the invention for design of novel laccase variants 

The analysis or comparison performed in step i) of the 
methods of the invention may be performed by use of any suitable 
computer programme capable of analysing and/or comparing amino 
10 acid sequences . 

The structural part which is identified in step i) of the 
'^'g methods of the invention may be composed of one amino acid 
residue. However, normally the structural part comprises more 
;:Jt than one amino acid residue, typically constituting^^ one of the 
iy.l5 above mentioned parts of the Coprinus structure such as one of 
;L5 the copper centres . 
Modifications 

;=n The modification of an amino acid residue or structural part 

O is typically accomplished by suitable modifications of a DMA 
sequence encoding the parent enzyme in question. The term 
Q "modified" as used in the methods according to the invention is 
intended to have the following meaning: When used in relation to 
an amino acid residue the term is intended to mean replacement of 
the amino acid residue in question with another amino ' acid 
25 residue. When used in relation to a structural part, the term is 
intended to mean: replacement of one or more amino acid residues 
of said structural part with other amino acid residues, or 
addition of one or more amino acid residues to said part, or 
deletion of one or more amino acid residues of said structural 
30 part . 

The construction of the variant of interest is accomplished 
by cultivating a microorganism comprising a DNA sequence encoding 
the variant under conditions which are conducive for producing 
the variant, and optionally subsequently recovering the variant 
35 from the resulting culture broth. This is described in detail 
further below. 
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Variants with altered oxidation potential 

The redox potentials of various wild type laccases have been 
found to be the following (measured at pH 5.3) : 

E°, V vs NHE 

Coprinus cinereus {SEQ ID No. 1): 0.55 
Polyporus pinsitus (SEQ ID No. 2) : 0.79 
Myceliophthora. thermophila (SEQ ID No. 10): 0.48 
Rhizoctonia. solani (SEQ ID No. 7): 0.75 
Scytalidium thermophilum (SEQ ID No. 9) : 0.53 



It is contemplated that it is possible to increase the 
oxidation potential of a parent Coprinus laccase or a parent 
Coprinur- like laccase, wherein said variant is the result of a 
mutation, i.e. one or more amino acid residues have been deleted 
15 from, replaced or added to the parent laccase. Preferred 
positions for mutations are the following: 



Coprinus cinereus laccase (SEQ ID No. 1) 
G411A,V,P,L; I,F,Y,W; 
3 20 G412A,V,P,L, I,F,Y,W; 
V409P,L,I,F,Y,W; 
■ T257A,V,P,L,I,F,Y,W; 
F358Y,W, I; 
T359A,V,P,L,I,F,Y,W; 
25 L48QI,F,Y,W; 
L351I,F,Y,W; 
E473A,V,P,L, I,F,Y,W; 
D98A,V,P,L, I,F,Y,W; 
G131A,V,P,L, I,F,Y,W; 
30 D443A,V,P,L, I,F,Y,W; 
R260 A,V, P,L, I,F,Y,W; in particular 
G411A, V; 
G412A,V,L, I; 
V409L, I; 
35 T257V; 
F358W; 
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T359V, F; 
L480I,F; 
L351I, F; 
E473L,I,F; 
5 D98L,I,F; 
G131V,L, I; 
D443I,F; 
R260I; 

D443I+R260I; 
10 D443F+R260I. 

□ Polyporus pinsitus (SEQ ID No. 2) : 
hJi A390V,P,L, I,F,Y,W; 
W G392A,V,P,L, I,F, Y,W; 
;";i5 E4 6 0D; in particular 
ijl A3 90V, L, I; 
G3 92A,V. 

IM^ Myceliophthora thermophila laccase (SEQ ID No . 10) : 

^^20 G5I1A,V,P,L,I,F,Y,W; 

y T428A,V,P,L, I,F, Y,W; 
S510A,V,P,L, I,F, Y,W; 
D106A,V,P,L, I,F, Y,W; 
N109A,V,P,L,I,F,Y,W,Q; 
25 L500I,F,Y,W; 

A108V, P,L, I, F, Y,W; 
G514A,V,P,L, I,F, Y,W; in particular 
G511A,V,L, I,F; 
T428V; 
30 S510V; 
D106L; 
N109I,F,Q; 
L500F; 
A108V, I; 
35 G514A,V,L, I,F. 

Preferred variants include any combination of the above mentioned 
mutations . 
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Variants with altered pH optimum 

The desired pH optimum of a laccase depends on which 
application is of interest, e.g., if the laccase is to be used 
5 for denim bleaching the preferred pH optimum will be around pH 5- 
8, whereas if the laccase is to be used for washing purposes the 
preferred pH optimum will be around pH 8-10. 

It is contemplated that it is possible to alter the pH 
optimum of a parent Coprinus laccase or a parent Coprinus- like 
10 laccase by creating a mutation in an amino acid within 10 A from 
the active His in the Cul site. 

Preferred positions for mutations are the following: 

Coprinus cinereus laccase (SEQ ID Mo. 1) : 
15 180-181; 
222-224; 
257; 

281-284; 
352-353 ; 
20 357-358; 
409-416; 
470-490. 

Polyporus pinsitus (SEQ ID No. 2) : 
25 E460L, I,F,M,S; 
F463L,M. 

Myceliophthora thermophila (SEQ ID No. 10) : 
192-193 ; 
30 234-235; 
269; 

293-294; 
364-3S5; 
372-373 ; 
35 426-433; 
503-513. 
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Preferred substitutions are the following: E, D, L, I, F, Y, W. 

Variants with altered mediator efficiency 

Laccases are often used in combination with so called 
5 mediators or enhancers, e.g., in combination with phenothiazine 
or phenothiazine related compounds (see WO 95/01426) or in 
combination with acetosyringone or acetosyringone related 
compounds (see WO 96/10079) . 

It is contemplated that it is possible to alter the mediator 
10 efficiency (in order to make the mediator more efficient) , of a 
parent Coprinus laccase or a parent Coprinus- like laccase by 
creating a mutation in an amino acid in one or more of the 
following positions : 



■J15 Coprinus cinereus laccase (SEQ ID No. 1) : 
179-182; 
223 ; 

281-282; 
353-358; 
j20 4 1 0 - 4 12; 
i 4 72; 

474-475; 
477-478. 

25 Preferred substitutions are the following: W, F, M, Y, R, K, S, 
T, N, Q; in particular the following substitutions: 
F358W; 
N478Q. 

30 Myceliophthora theirmophila. laccase (SEQ ID No. 10) : 
185-194; 
235; 

293-294; 
355-373 ; 
35 427-429; 
505; 

507-508 ; 
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510-511. 

Preferred substitutions are in particular one or more of the 
following mutations: 
5 N189G,A, S,T; 
S190G,A; 

F371* (deletion) ; 
F371G,A. 

10 Polyporus pinsitus (SEQ ID No . 2) : 
G392A; 
A461T,S; 
N250Q, Y; 

G165K,R. - 
15 ■ 

Altered O . /OH'-pathwav 

It is contemplated that it is possible to lower the 
possibility of OH" entering the trinuclear Cu site by producing 
J 20 one or more of the following mutations: 

Myceliophthora. thermophila (SEQ ID No . 10) : 
A506E; 
N109D; 
25 H93E; 
H95E; 
M433E; 
M480E. 

30 Polyporus pinsitus (SEQ ID No . 2) : 
F81D,E; 
L112D,E; 
A8 0D,E. 

35 

Methods of preparing laccase variants 

Several methods for introducing • mutations into genes are 
known in the art. After a brief discussion of the cloning of 
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laccase-encoding DNA sequences, methods for generating mutations 
at specific sites within the laccase-encoding sequence will be 
discussed. 

Cloning a DNA secaience encoding- a laccase 
5 The DNA sequence encoding a parent laccase may be isolated 

from any cell or microorganism producing the laccase in question, 
using various methods well known in the art. First, a genomic DNA 
and/or cDNA library should be constructed using chromosomal DNA 
or messenger RNA from the organism that produces the laccase to 
10 be studied. Then, if the amino acid sequence of the laccase is 
known, homologous, labelled oligonucleotide probes may be 
Q synthesized and used to identify laccase-encoding clones from a 
;:Jt genomic library prepared from the organism in question. Alter- 
iy natively, a labei:.Bd oligonucleotide probe containing sequences 
jj|l5 homologous to a known laccase gene could be used as a probe to 
iH identify laccase-encoding clones, using hybridization and washing 
3 conditions of lower stringency. 

M= method for identifying laccase-encoding clones involves 

ry inserting cDNA into an expression vector, such as a plasmid, 
transforming laccase-negative fungi with the resulting cDNA 
O library, and then plating the transformed fungi onto agar 
containing a substrate for laccase, thereby allowing clones 
expressing the laccase to be identified. 

Alternatively, the DNA sequence encoding the enzyme may be 
25 prepared synthetically by established standard methods, e.g. the 
phosphoroamidite method. In the phosphoroamidite method, oligonu- 
cleotides are synthesized, e.g. in an automatic DNA synthesizer, 
purified, annealed, ligated and cloned in appropriate vectors. 

Finally, the DNA sequence may be of mixed genomic and synthe- 
30 tic origin, mixed synthetic and cDNA origin or mixed genomic and 
cDNA origin, prepared by ligating fragments of synthetic, genomic 
or cDNA origin (as appropriate, the fragments corresponding to 
various parts of the entire DNA sequence) , in accordance with 
standard techniques. The DNA sequence may also be prepared by 
35 polymerase chain reaction (PGR) using specific primers. 
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Site-directed mutaaenesis 

Once a laccase-encoding DNA sequence has been isolated, and 
desirable sites for mutation identified, mutations may be intro- 
duced using synthetic oligonucleotides. These oligonucleotides 
contain nucleotide sequences flanking the desired mutation sites; 
mutant nucleotides are inserted during oligonucleotide synthesis. 
In a specific method, a single -stranded gap of DNA, bridging the 
laccase-encoding sequence, is created in a vector carrying the 
laccase gene. Then the synthetic nucleotide, bearing the desired 
mutation, is annealed to a homologous portion of the single- 
stranded DNA. The remaining gap is then filled in with TV DNA 
polymerase and the construct is ligated using T4 ligase. A 
specific example of this method is described in Morinaga et al . 
(1984). us 4,760,025 discloses the .introduction of oligonucleoti- 
15 des encoding multiple mutations by performing minor alterations 
of the cassette. However, an even greater variety of mutations 
can be introduced at any one time by the Morinaga method, because 
a multitude of oligonucleotides, of various lengths, can be 
introduced. 

20 Another method of introducing mutations into laccase-encoding 

DNA sequences is described in Nelson and Long (1989) . It involves 
the 3 -step generation of a PGR fragment containing the desired 
mutation introduced by using a chemically synthesized DNA strand 
as one of the primers in the PGR reactions. From the PCR-gener- 
5 ated fragment, a DNA fragment carrying the mutation may be 
isolated by ' cleavage with restriction endonucleases and 
reinserted into an expression plasmid. 



) 



Random mutagenesis 

The random mutagenesis of a DNA sequence encoding a parent 
laccase may conveniently be performed by use of any method known 
in the art . 

For instance, the random mutagenesis may be perfo2rmed by use 
of a suitable physical or chemical mutagenizing agent, by use of 
a suitable oligonucleotide, or by subjecting the DNA sequence to 
PGR generated mutagenesis. Furthermore, the random mutagenesis 
may be performed by use of any combination of these mutagenizing 
agents . 
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The mutagenizing agent may, e.g., be one which induces tran- 
sitions, transversions, inversions, scrambling, deletions, and/or 
insertions . 

Examples of a physical or chemical mutagenizing agent 
suitable for the present purpose include ultraviolet (UV) ir- 
radiation, hydroxylamine, N-methyl-N' -nitro-N-nitrosoguanidine 
(MNNG) , 0-methyl hydroxyl amine , nitrous acid, ethyl methane 
sulphonate (EMS) , sodium bisulphite, formic acid, and nucleotide 
analogues . 

When such agents are used, the mutagenesis is typically per- 
formed by incubating the DNA sequence encoding the parent enzyme 
to be mutagenized in the presence of the mutagenizing agent of 
choice under suitable conditions for the mutagenesis to take 
place, and selecting for mutated DNA having the desired 
properties . 

When the mutagenesis is performed by the use of an oligo- 
nucleotide, the oligonucleotide may be doped or spiked with the 
three non-parent nucleotides during the synthesis of the 
oligonucleotide at the positions which are to be changed. The 
doping or spiking may be done so that codons for unwanted amino 
acids are avoided. The doped or spiked oligonucleotide can be 
incorporated into the DNA encoding the laccase enzyme by any 
published technique, using e.g. PGR, LCR or any DNA polymerase 
and ligase . 

When PCR-generated mutagenesis is used, either a chemically 
treated or non-treated gene encoding a parent laccase enzyme is 
subjected to PGR under conditions that increase the mis- 
incorporation of nucleotides (Deshler 1992; Leung et al . , 
Technique, Vol.1, 1989, pp. 11-15) . 

A mutator strain of E. coli (Fowler et al . , Molec. Gen. 
Genet., 133, 1974, pp. 179-191), S. cereviseae or any other 
microbial organism may be used for the random mutagenesis of the 
DMA encoding the laccase enzyme by e.g. transforming a plasmid 
containing the parent enzyme into the mutator strain, growing the 
mutator strain with the plasmid and isolating the mutated plasmid 
from the mutator strain. The mutated plasmid may subsequently be 
transformed into the expression' organism. 
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The DNA sequence to be mutagenized may conveniently be 
present in a genomic or cDNA library prepared from an organism 
expressing the parent laccase enzyme. Alternatively, the DNA se- 
quence may be present on a suitable vector such as a plasmid or a 
5 bacteriophage, which as such may be incubated with or otherwise 
exposed to the mutagenizing agent. The DNA to be mutagenized may 
also be present in a host cell either by being integrated in the 
genome of said cell or by being present on a vector harboured in 
the cell. Finally, the DNA to be mutagenized may be in isolated 
10 form. It will be understood that the DNA sequence to be subjected 
to random mutagenesis is preferably a cDNA or a genomic *DNA 
□ sequence . 

■irl In some cases it may be convenient to amplify the mutated DNA 

Ly sequence prior to the expression step or the screening step being 
:'V-5 performed. Such amplification may be performed in accordance with 
Lfi methods known in the art, the presently preferred method being 
PCR-generated amplification using oligonucleotide primers 
k,,i, prepared on the basis of the DNA or amino acid sequence of the 
\ parent enzyme . 

CSO Subsequent to the incubation with or exposure to the mutage- 

nizing agent, the mutated DNA is expressed by culturing a 
suitable host cell carrying the DNA sequence under conditions 
allowing expression to take place. The host cell used for this 
purpose may be one which has been transformed with the mutated 
25 DNA sequence, optionally present on a vector, or one which was 
carried the DNA sequence encoding the parent enzyme during the 
mutagenesis treatment. Examples of suitable host cells are fungal 
hosts such as Aspergillus niger or Aspergillus oryzae. 

30 ■ The mutated DNA sequence may further comprise a DNA sequence 
encoding functions permitting expression of the mutated DNA 
sequence . 

Localized random mutagenesis 
35 The random mutagenesis may advantageously be localized to a 

part of the parent laccase in question. This may, e.g., .be 
advantageous when certain regions of the enzyme have been 
identified to be of particular importance for a given property of 
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the enzyme, and when modified are expected to result in a variant 
having improved properties. Such regions may normally be 
identified when the tertiary structure of the parent enzyme has 
been elucidated and related to the function of the enzyme. 

The localized random mutagenesis is conveniently performed by 
use of PCR-generated mutagenesis techniques as described above or 
any other suitable technique known in the art. 

Alternatively, the DNA sequence encoding the part of the DNA 
sequence to be modified may be isolated, e.g. by being inserted 
into a- suitable vector, and said part may subsequently be subjec- 
ted to mutagenesis by use of any of the mutagenesis methods 
discussed above. 

With respect to the screening step in the above-mentioned 
method of the invention, this may conveniently be performed by 
use of aa filter assay based on the following principle: 

A microorganism capable of expressing the mutated laccase 
enzyme of interest is incubated on a suitable medium and under 
suitable conditions for the enzyme to be secreted, the medium 
being provided with a double filter comprising a first protein- 
binding filter and on top of that a second filter exhibiting a 
low protein binding capability. The microorganism is located on 
the second filter. Subsequent to the incubation, the first filter 
comprising enzymes secreted from the microorganisms is separated 
from the second filter comprising the microorganisms. The first 
filter is siibjected to screening for the desired enzymatic 
activity and the corresponding microbial colonies present on the 
second filter are identified. 

The filter used for binding the enzymatic activity may be any 
protein binding filter e.g. nylon or nitrocellulose. The top 
filter carrying the colonies of the expression organism may be 
any filter that has no or low affinity for binding proteins e.g. 
cellulose acetate or Durapore™. The filter may be pretreated with 
any of the conditions to be used for screening or may be treated 
during the detection of enzymatic activity. 

The enzymatic activity may be detected by a dye, 
fluorescence, precipitation, pH indicator, IR-absorbance or any 
other known . technique for detection of enzymatic activity. 

The detecting compound may be immobilized by any immobilizing 
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agent, e.g., agarose, agar, gelatine, polyacrylamide, starch, 
filter paper, cloth; or any combination of immobilizing agents. 

Laccase activity 

5 In the context of this invention, the laccase activity was 

measured using 10- (2-hydroxyethyl) -phenoxazine (HEPO) as 
substrate for the various laccases. HEPO was synthesized using 
the same procedure as described for 10- (2-hydroxyethyl) - 
phenothiazine, (G.' Cauquil in Bulletin de la Society Chemique de 
10 France, 1960, p. 1049) . In the presence of oxygen laccases (E.G. 
1.10.3.2) oxidize HEPO to a HEPO radical that can be monitored 
□ photometrically at 52 8 nm. 

Q The Coprinus cinereus laccase and the Polyporus pinsitus 

m laccase were measured using 0.4 mM HEPO in 50 mM MES-NaOH, pH 
ijs 5.5. The absorbance at 528 nm was followed for 200 s and the 
m rate calculated from the linear part of the progress curve. 

The Myceliophthora. thermophila. laccase was measured using 
i-^ 0.4 mM HEPO in 25 mM Tris-HCl, pH 7.5, 0.05% Tween-20 at 3 0 °C. 
g The absorbance at 52 8 nm was followed for 200 s and the rate 
■■^40 calculated from the linear part of the progress curve. 

Expression o£ laccase variants 

According to the invention, a DNA sequence encoding the 
variant produced by methods described above, or by any alterna- 
25 tive methods known in the art, can be expressed, in enzyme form, 
using an expression vector which typically includes control 
sequences encoding a promoter, operator, ribosome binding site, 
translation initiation signal, and, optionally, a repressor gene 
or various activator genes . 
30 The recombinant expression vector carrying the DNA sequence 

encoding a laccase variant of the invention may be any vector 
which may conveniently be subjected to recombinant DNA 
procedures, and the choice of vector will often depend on the 
host cell into which it is to be introduced. Thus, the vector may 
35 be. an autonomously replicating vector, i.e. a vector which exists 
as an extrachromosomal entity, the replication of which .is 
independent of chromosomal replication, e.g. a plasmid, a 
bacteriophage or an extrachromosomal element, minichromosome or 
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an artificial chromosome. Alternatively, the vector may be one 
which, when introduced into a host cell, is integrated into the 
host cell genome and replicated together with the chromosome (s) 
into which it has been integrated. 

In the vector, the DNA sequence should be operably connected 
to a suitable promoter sequence. The promoter may be any DNA 
sequence which shows transcriptional activity in the host cell of 
choice and may be derived from genes encoding proteins either 
homologous or heterologous to the host cell. Examples of suitable 
promoters for directing the transcription of the DNA sequence 
encoding a laccase variant of the invention, especially in a 
O fungal host, are those derived from the gene encoding A. oryzae 
Q TAKA amylase, Rhizomucor miehei aspartic proteinase, A. niger 
|:| neutral a-amylase, A. niger acid stable a-amylase, A. niger glu- 
W15 coamylase, Rhizomucor miehei lipase, A. oryzae alkaline protease, 
Q A. oryzae triose phosphate isomerase or A. nidulans acetamidase . 

The expression vector of the invention may also comprise a 
ry suitable transcription terminator and, in eukaryotes, poly- 
;==j adenylation sequences operably connected to the DNA sequence 
encoding the laccase variant of the invention. Termination and 
polyadenylation sequences may suitably be derived from the same 
sources as the promoter. 

The vector may further comprise a DNA sequence enabling the 
vector to replicate in the host cell in question. Examples of 
such sequences- are the origins of replication of plasmids pUC19, 
pACYClVV, pUBllO, pE194, pAMBl and pIJ702 . 

The vector may also comprise a selectable marker, e.g. a 
gene, the product of which complements a defect in the host cell, 
such as one which confers antibiotic resistance such as ampicil- 
lin, kanamycin, chloramphenicol or tetracyclin resistance. Fur- 
thermore, the vector may comprise Aspergillus selection markers 
such as amdS, argB, niaD and sC, a marker giving rise to 
hygromycin resistance, or the selection may be accomplished by 
CO -transformation, e.g. as described in WO 91/17243. 

The procedures used to ligate the DNA construct of the inven- 
tion encoding a laccase variant, the promoter, terminator and 
other elements, respectively, and to insert them into suitable 



120 



25 



30 



35 



5200-wo 

20 

vectors containing the information necessary for replication, are 
well known to persons skilled in the art (cf., for instance, 
Sambrook et al . (1989)). 

The cell of the invention, either comprising a DNA construct 
3 or an expression vector of the invention as defined above, is 
advantageously used as a host cell in the recombinant production 
of a laccase variant of the invention. The cell may be 
transformed with the DNA construct of the invention encoding the 
variant, conveniently by integrating the DNA construct (in one or 
10 more copies) in the host chromosome. This integration is 
generally considered to be an advantage as the DNA sequence is 
more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional --methods, e.g. by homologous or 
15 heterologous recombination. Alternatively, the cell may be 
transformed with an expression vector as described above ' in 
connection with the different types of host cells. 

The cell of the invention may be a cell of a higher organism 
such as a mammal or an insect, but is preferably a microbial 
20 cell, e.g. a fungal cell. 

The filamentous fungus may advantageously belong to a species 
of Aspergillus, e.g. Aspergillus oryzae or Aspergillus niger. 
Fungal cells may be transformed by a process involving protoplast 
formation and transformation of the protoplasts followed by 
25 regeneration of the cell wall in a manner known per se. A 
suitable procedure for transformation of Aspergillus host cells 
is described in EP 238 023. 

In a yet further aspect, the present invention relates to a 
method of producing a laccase variant of the invention, which 
30 method comprises cultivating a host cell as described above under 
conditions conducive to the production of the variant and 
recovering the variant from the cells and/or culture medium. 

The medium used to cultivate the cells may be any convention- 
al medium suitable for growing the host cell in question and 
35 obtaining expression of the laccase variant of the invention. 
Suitable media are available from commercial suppliers or may -be 
prepared according to published recipes (e.g. as described in 
catalogues of the American Type Culture Collection). 
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The laccase variant secreted from the host cells may con- 
veniently be recovered from the culture medium by well-known 
procedures, including separating the cells from the medium by 
centrifugation or filtration, and precipitating proteinaceous 
components of the medium by means of a salt such as ammonium 
sulphate, followed by the use of chromatographic procedures such 
as ion exchange chromatography, affinity chromatography, or the 
like. 

Industrial Applications 

The laccase variants of this invention possesses valuable 
properties allowing for various industrial applications, in 
particular lignin modification, paper strengthening, dye transfer 
inhibition in detergents, phenol polymerization, hair dyeing, - 
textile dyeing, bleaching of textiles (in particular bleaching of 
denim as described in WO 96/12845 and WO 96/12846} and waste 
water treatment . 
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Appendix 1 : 

SEQRES 1 A 504 GLN ILE VAL ASN SER VAL ASP THR MET THR LEU THR ASN 
SEQRES 2 A 5 04 ALA ASN VAL SER PRO ASP GLY PHE THR ARG ALA GLY ILE 

5 

SEQRES 3 A 504 LEU VAL ASN GLY VAL HIS GLY PRO LEU ILE ARG GLY GLY 

SEQRES 4 A 5 04 LYS ASN ASP ASN PHE GLU LEU ASN VAL VAL ASN ASP LEU 

10 SEQRES 5 A 504 ASP ASN PRO THR MET LEU ARG PRO THR SER ILE HIS TRP 

SEQRES S A 5 04 HIS GLY LEU PHE GLN ARG GLY THR ASN TRP ALA ASN GLY 
SEQRES 7 A 5 04 ALA ASP GLY VAL ASN GLN CYS PRO ILE SER PRO GLY HIS 

15 SEQRES 8 A 504 ALA PHE LEU TYR LYS PHE THR PRO ALA GLY HIS ALA GLY 

SEQRES 9 A 504 THR PHE TRP TYR HIS SER HIS PHE GLY THR GLN TYR CYS 

SEQRES 10 A 5 04 ASP GLY LEU ARG GLY PRO MET VAL ILE TYR ASP ASP ASN 

20 

SEQRES 11 A 5 04 ASP PRO HIS ALA ALA LEU TYR ASP GLU ASP ASP GLU ASN 

- SEQRES 12 A 5 04 THR ILE ILE THR LEU ALA ASP TRP TYR HIS ILE PRO ALA 

25 SEQRES 13 A 5 04 PRO SER ILE GLN GLY ALA ALA GLN PRO ASP ALA THR LEU 

SEQRES 14 A 504 ILE ASN GLY LYS GLY ARG TYR VAL GLY GLY PRO ALA ALA 

SEQRES 15 A 504 GLU LEU SER ILE VAL ASN VAL GLU GLN GLY LYS LYS TYR 

30 

SEQRES 16 A 5 04 ARG MET ARG LEU ILE SER LEU SER CYS ASP PRO ASN TRP 

SEQRES 17 A 504 GLN PHE SER ILE ASP GLY HIS GLU LEU THR ILE ILE GLU 

35 SEQRES 18 A 504 VAL ASP GLY ASN LEU THR GLU PRO HIS THR VAL ASP ARG 

SEQRES 19 A 5 04 LEU GLN ILE PHE THR GLY GLN ARG TYR SER PHE VAL LEU 

SEQRES 20 A 5 04 ASP ALA ASN GLN PRO VAL ASP ASN TYR TRP ILE ARG ALA 

SEQRES 21 A 5 04 GLN PRO ASN LYS GLY ARG ASN GLY LEU ALA GLY THR PHE 

SEQRES 22 A 5 04 ALA ASN GLY VAL ASN SER ALA ILE LEU ARG TYR ALA GLY 

40 SEQRES 23 A 504 ALA ALA ASN ALA ASP PRO THR THR SER ALA ASN PRO ASN 

SEQRES 24 A 5 04 PRO ALA GLN LEU ASN GLU ALA ASP LEU HIS ALA LEU ILE 

SEQRES 25 A 5 04 ASP PRO ALA ALA PRO GLY ILE PRO THR PRO GLY ALA ALA 

SEQRES 26 A 504 ASN VAL ASN LEU ARG PHE GLN LEU GLY PHE SER GLY GLY 

SEQRES 27 A 5 04 ARG PHE THR ILE ASN GLY THR ALA TYR GLU SER PRO SER 

45 SEQRES 28 A 5 04 VAL PRO THR LEU LEU GLN ILE MET SER GLY ALA GLN SER 
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0 


-3 


.838 


40 


. 281 


1 


. 903 


1 


.00 


38 


. 67 




ATOM 


4126 


owo 


WAT 


w 


255 


0 


6 


.837 


35 


. 163 


51 


. 935 


1 


.00 


58 


.57 




ATOM 


4127 


owo 


WAT 


w 


256 


0 


19 


.740 


62 


. 853 


17 


. 880 


1 


. 00 


52 


.39 




ATOM 


4128 


owo 


WAT 


w 


258 


0 


-0 


. 994 


41 


. 755 


22 


. 088 


0 


. 00 


69 


. 57 


10 


ATOM 


4129 


owo 


WAT 


w 


259 


0 


1 


.221 


10 


. 473 


15 


. 458 


1 


. 00 


54 


.80 




ATOM 


4130 


owo 


WAT 


w 


260 


0 


23 


.445 


55 


. 367 


31 


.430 


1 


. 00 


48 


.90 




ATOM 


4131 


owo 


WAT 


w 


261 


0 


23 


. 757 


57 


. 854 


34 


. 657 


1 


. 00 


37 


.69 




ATOM 


4132 


owo 


WAT 


w 


262 


0 


8 


.508 


19 


.111 


34 


. 572 


1 


. 00 


55 


.52 




ATOM 


4133 


owo 


WAT 


w 


263 


0 


22 


.806 


22 


.381 


3 


. 611 


1 


. 00 


64 


.20 


15 


ATOM 


4134 


owo 


WAT 


w 


264 


0 


0 


.398 


22 


. 602 


42 


. 525 


1 


. 00 


58 


.85 




ATOM 


4135 


owo 


WAT 


w 


265 


0 


4 


.195 


52 


.287 


43 


.465 


1 


.00 


36 


. 84 




ATOM 


413S 


owo 


WAT 


w 


266 


0 


20 


. 211 


6 


.536 


4 


. 911 


1 


. 00 


39 


.34 




ATOM 


4137 


owo 


WAT 


w 


267 


0 


14 


. 6eo 


16 


. 117 


2 


. 803 


1 


. 00 


45 


. 76 




ATOM 


4138 


owo 


WAT 


w 


263 


0 


14 


. 938 


25 


. 582 


6 


. 850 


1 


. 00 


41 


. 01 


20 


ATOM 


4139 


owo 


WAT 


w 


269 


0 


7 


. 763 


7 


. 940 


31 


. 891 


0 


. 00 


71 


. 30 




ATOM 


4140 


owo 


WAT 


w 


270 


0 


-3 


. 459 


33 


.491 


39 


. 400 


1 


. 00 


40 


. 80 




ATOM 


4141 


owo 


WAT 


w 


271 


0 


23 


.154 


22 


. 897 


6 


. 985 


1 


00 


48 


. 25 




ATOM 


4142 


owo 


WAT 


w 


272 


0 


34 


. 916 


25 


. 555 


28 


. 092 


1 


00 


52 


. 63 




ATOM 


4143 


owo 


WAT 


w 


273 


0 


3 


. 332 


45 


. 481 


50 


. 776 


1 


00 


47 


. 23 


25 


ATOM 


4144 


owo 


WAT 


w 


274 


0 


-3 


. 441 


57 


. 643 


28 


. 775 


1 


00 


49 


70 




ATOM 


4145 


owo 


WAT 


w 


275 


0 


23 


.213 


40 


.573 


47 


.561 


1 


00 


56 


02 




ATOM 


4146 


owo 


WAT 


w 


276 


0 


5 


.421 


55 


179 


45 


. 172 


1 


00 


52 


70 




ATOM 


4147 


owo 


WAT 


w 


277 


0 


-3 


012 


21 


908 


40 


. 933 


1 


00 


41 


69 




ATOM 


4148 


owo 


WAT 


w 


278 


0 


26 


328 


53 


637 


17 


. 905 


1. 


00 


37 


80 


30 


ATOM 


4149 


owo 


WAT 


w 


279 


0 


9 


740 


58 


922 


43 


485 


1. 


00 


52 


06 




ATOM 


4150 


owo 


_WAT 


w 


280 


0 


23 


545 


15, 


660 


4 


258 


1. 


00 


41 


55 




ATOM 


4151 


owo 


WAT 


w 


231 


0 


22 


652 


31. 


154 


51 


246 


1. 


00 


53 . 


65 




ATOM 


4152 


owo 


WAT 


w 


282 


0 


22 


192 


51. 


135 


8 


251 


1. 


00 


44 . 


76 




ATOM 


4153 


owo 


WAT 


w 


2.83 


0 


-6 . 


046 


22 . 


886 


24 


288 


1. 


00 


52 . 


40 


35 


ATOM 


4154 


owo 


WAT 


w 


284 


0 


19. 


949 


45 . 


27S 


49 


516 


1 . 


00 


54 . 


58 




ATOM 


4155 


owo 


WAT 


w 


285 


0 


7 . 


388 


22 . 


308 


32 


108 


1 . 


00 


43 . 


62 




ATOM 


4156 


owo 


WAT 


w 


286 


0 


15 . 


080 


50. 


452 


2 . 


795 


1 . 


00 


52 . 


20 




ATOM 


4157 


owo 


WAT 


w 


287 


0 


1 . 


016 


62 . 


235 


30 . 


878 


1 . 


00 


56 . 


81 




ATOM 


4158 


owo 


WAT 


w 


288 


0 


23 . 


803 


52 . 


570 


27. 


699 


1 . 


00 


56 . 


22 


40 


ATOM 


4159 


owo 


WAT 


w 


289 


0 


-10 . 


525 


31. 


623 


13 . 


870 


1 . 


00 


47 . 


21 




ATOM 


4160 


owo 


WAT 


w 


290 


0 


1 . 


599 


55 . 


502 


24 . 


567 


1 . 


00 


44 . 


50 




ATOM 


4161 


owo 


WAT 


w 


291 


0 


-15 . 


671 


37. 


251 


14 . 


660 


1. 


00 


83 . 


62 




ATOM 


4162 


owo 


WAT 


w 


292 


0 


7 . 


231 


7. 


950 


17 . 


754 


1 . 


00 


50 . 


61 




ATOM 


4163 


owo 


WAT 


w 


293 


0 


-4 . 


009 


34 . 


057 


42 . 


492 


1 . 


00 


78 . 


48 


45 


ATOM 


4164 


owo 


WAT 


w 


294 


0 


21 . 


004 


58 . 


371 


18 . 


690 


1 . 


00 


61 . 


15 
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ATOM 


4165 


OWO 


WAT 


W 


295 


0 


15 


.405 


48 


869 


52 


.211 


1 


.00 


53 


17 


ATOM 


4166 


OWO 


WAT 


W 


295 


0 


7 


.329 


31 


202 


1 


.964 


1 


.00 


38 


86 


ATOM 


4167 


OWO 


WAT 


W 


297 


0 


9 


.518 


53 


886 


5 


.457 


1 


. 00 


41 


52 


ATOM 


4158 


OWO 


WAT 


W 


299 


0 


10 


.398 


48 


995 


0 


.335 


1 


. 00 


49 


64 


ATOM 


4169 


OWO 


WAT 


W 


299 


0 


9 


.889 


15 


077 


3 


.774 


1 


. 00 


42 


28 


ATOM 


4170 


OWO 


WAT 


W 


300 


0 


15 


.854 


56 


731 


10 


.934 


1 


. 00 


44 


02 



10 



a 

m 

m 
m 
Q 

:: 

m 

Q 

a 
Q 



5200-wo 



117 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: NOVO NORDISK 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 
10 (E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2800 

(G) TELEPHONE: +45 44 44 8 8 8 8 

(H) TELEFAX: +45 44 49 05 55 

15 (ii) TITLE OF INVENTION: LACCASE MUTANTS 

(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.3 0 (EPO) 



25 



35 



45 



50 



65 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 9 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1; 



Met Phe Lys Asn Leu Leu Ser Phe Ala Leu Leu Ala lie Ser Val Ala 
40 1 5 10 15 



Asn Ala Gin lie Val Asn Ser Val Asp Thr Met Thr Leu Thr Asn Ala 
20 25 30 

Asn Val Ser Pro Asp Gly Phe Thr Arg Ala Gly lie Leu Val Asn Gly 
35 ^ 40 45 

Val His Gly Pro Leu lie Arg Gly Gly Lys Asn Asp Asn Phe Glu Leu 
50 55 60 

Asn Val Val Asn Asp Leu Asp Asn Pro Thr Met Leu Arg Pro Thr Ser 
S5 70 75 30 



He His Trp His Gly Leu Phe Gin Arg Gly Thr Asn Trp Ala Asp Gly 
55 85 90 95 

Ala Asp Gly Val Asn Gin Cys Pro He Ser Pro Gly His Ala Phe Leu 
100 105 110 

60 Tyr Lys Phe Thr Pro Ala Gly His Ala Gly Thr Phe Trp Tyr His Ser 

115 120 125 



His Phe Gly Thr, Gin Tyr Cys Asp Gly Leu Arg Gly Pro Met Val He 
130 135 140 

Tyr Asp Asp Asn Asp Pro His Ala Ala Leu Tyr Asp Glu Asp Asp Glu 

145 150 155 ISO 

Asn Thr He He Thr Leu Ala Asp Trp Tyr His lie Pro Ala Pro Ser 
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165 170 175 

lie Gin Gly Ala Ala Gin Pro Asp Ala Thr Leu lie Asn Gly Lys Gly 
180 185 190 

Arg Tyr Val Gly Gly Pro Ala Ala Glu Leu Ser lie Val Asn Val Glu 
195 200 205 

Gin Gly Lys Lys Tyr Arg Met Arg Leu lie Ser Leu Ser Cys Asp Pro 
210 215 220 

Asn Trp Gin Phe Ser lie Asp Gly His Glu Leu Thr lie He Glu Val 
225 230 235 240 

Asp Gly Gin Leu Thr Glu Pro His Thr Val Asp Arg Leu Gin He Phe 
245 250 255 

Thr Gly Gin Arg Tyr Ser Phe Val Leu Asp Ala Asn Gin Pro Val Asp 
2S0 255 270 

Asn Tyr Trp He Arg Ala Gin Pro Asn Lys Gly Arg Asn Gly Leu Ala 
275 280 285 

Gly Thr Phe Ala Asn Gly Val Asn Ser Ala lie Leu Arg Tyr Ala Gly 
290 295 - 300 

Ala Ala Asn Ala Asp Pro Thr Thr Ser Ala Asn Pro Asn Pro Ala Gin 
305 310 315 320 

Leu Asn Glu Ala Asp Leu His Ala Leu lie Asp Pro Ala Ala Pro Gly 
325 330 335 

lie Pro Thr Pro Gly Ala Ala Asp Val Asn Leu Arg Phe Gin Leu Gly 
340 345 

Phe Ser Gly Gly Arg Phe Thr He Asn Gly Thr Ala Tyr Glu Ser Pro 
355 360 365 

Ser Val Pro Thr Leu Leu Gin He Met Ser Gly Ala Gin Ser Ala Asn 
370 375 380 

Asp Leu Leu Pro Ala Gly Ser Val Tyr Glu Leu Pro Arg Asn Gin Val 
385 390 395 400 

Val Glu Leu Val Val Pro Ala Gly Val Leu Gly Gly Pro His Pro Phe 
405 410 415 

His Leu His Gly His Ala Phe Ser Val Val Arg Ser Ala Gly Ser Ser 
420 425 430 

Thr Tyr Asn Phe Val Asn Pro Val Lys Arg Asp Val Val Ser Leu Gly 
435 440 445 

Val Thr Gly Asp Glu Val Thr He Arg Phe Val Thr Asp Asn Pro Gly 
450 455 460 

Pro Trp Phe Phe His Cys His He Glu Phe His Leu Met Asn Gly Leu 
465 470 475 430 

Ala He Val Phe Ala Glu Asp Met Ala Asn Thr Val Asp Ala Asn Asn 
485 490 495 

Pro Pro Val Glu Trp Ala Gin Leu Cys Glu He Tyr Asp Asp Leu Pro 
500 505 510 



Pro Glu Ala Thr Ser He Gin Thr Val Val Arg Arg Ala Glu Pro Thr 
515 520 525 
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Gly Phe Ser Ala Lys Phe Arg Arg Glu Gly Leu 
530 535 ' 

5 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 9 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



20 



35 



50 



65 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gly He Gly Pro Val Ala Asp Leu Thr He Thr Asn Ala Ala Val Ser 
15 10 15 

Pro Asp Gly Phe Ser Arg Gin Ala Val Val Val Asn Gly Gly Thr Pro 
20 25 30 



Gly Pro Leu He Thr Gly Asn Met Gly Asp Arg Phe Gin Leu Asn Val 
25 35 40 45 

He Asp Asn Leu Thr Asn His Thr Met Leu Lys Ser Thr Ser He His 

50 55 60 

30 Trp His Gly Phe Phe Gin Lys Gly Thr Asn Trp Ala Asp Gly Pro Ala 

65 70 75 80 



Phe He Asn Gin Cys Pro He Ser Ser Gly His Ser Phe Leu Tyr Asp 
85 90 95 

Phe Gin Val Pro Asp Gin Ala Gly Thr Phe Trp Tyr His Ser His Leu 
100 105 110 



Ser Thr Gin Tyr Cys Asp Gly Leu Arg Gly Pro Phe Val Val Tyr Asp 
40 115 120 125 

Pro Asn Asp Pro Ala Ala Asp Leu Tyr Asp Val Asp Asn Asp Asp Thr 
130 135 140 

45 Val He Thr Leu Val Asp Trp Tyr His Val Ala Ala Lys Leu Gly Pro 

145 . 150 155 160 



Ala Phe Pro Leu Gly Ala Asp Ala Thr Leu He Asn Gly Lys Gly Arg 
165 170 175 

Ser Pro Ser Thr Thr Thr Ala Asp Leu Ser Val He Ser Val Thr Pro 
180 185 190 



Gly Lys Arg Tyr Arg Phe Arg Leu Val Ser Leu Ser Cys Asp Pro Asn 
55 195 200 205 

Tyr Thr Phe Ser He Asp Gly His Asn Met Thr He He Glu Thr Asp 
210 215 220 

60 Ser He Asn Thr Ala Pro Leu Val Val Asp Ser He Gin He Phe Ala 

225 230 235 240 



Ala Gin Arg Tyr Ser Phe Val Leu Glu Ala Asn Gin Ala Val Asp Asn 
245 250 255 

Tyr Trp He Arg Ala Asn Pro Asn Phe Gly Asn Val Gly Phe Thr Gly 
260 265 270 
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Gly lie Asn Ser Ala lie Leu Arg Tyr Asp Gly Ala Ala Ala Val Glu 
275 280 285 

Pro Thr Thr Thr Gin Thr Thr Ser Thr Ala Pro Leu Asn Glu Val Asn 
290 295 300 

Leu His Pro Leu Val Thr Thr Ala Val Pro Gly Ser Pro Val Ala Gly 
305 310 315 320 

Gly Val Asp Leu Ala lie Asn Met Ala Phe Asn Phe Asn Gly Thr Asn 
325 330 335 

Phe Phe lie Asn Gly Ala Ser Phe Thr Pro Pro Thr Val Pro Val Leu 
340 345 350 

Leu Gin He He Ser Gly Ala Gin Asn Ala Gin Asp Leu Leu Pro Ser 
355 360 365 

Gly Ser Val Tyr Ser Leu Pro Ser Asn Ala Asp He Glu He Ser Phe 
20 370 375 380 

Pro Ala Thr Ala Ala Ala Pro Gly Ala Pro His Pro Phe His Leu His 
385 390 395 400 

Gly His Ala Phe Ala Val Va\: Arg Ser Ala Gly Ser Thr Val Tyr Asn 
405 410 415 

Tyr Asp Asn Pro He Phe Arg Asp Val Val Ser Thr Gly Thr Pro Ala 
420 425 430 

Ala Gly Asp Asn Val Thr He Arg Phe Arg Thr Asp Asn Pro Gly Pro 
435 440 445 

Trp Phe Leu His Cys His He Asp Phe His Leu Glu Ala Gly Phe Ala 
35 450 455 460 

Val Val Phe Ala Glu Asp He Pro Asp Val Ala Ser Ala Asn Pro Val 
465 470 475 480 

Pro Gin Ala Trp Ser Asp Leu Cys Pro Thr Tyr Asp Ala Leu Asp Pro 
485 490 ■" 495 

Ser Asp Gin 



25 



30 



40 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

Ala He Gly Pro Val Ala Ser Leu Val Val Ala Asn Ala Pro Val Ser 
60 1 5 10 15 

Pro Asp Gly Phe Leu Arg Asp Ala He Val Val Asn Gly Val Val Pro 
20 25 30 

65 Ser Pro Leu He Thr Gly Lys Lys Gly Asp Arg Phe Gin Leu Asn Val 

35 40 45 

Val Asp Thr Leu Thr Asn His Ser Met Leu Lys Ser Thr Ser He His 
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50 55 60 

Trp His Gly Phe Phe Gin Ala Gly Thr Asn Trp Ala Glu Gly Pro Ala 
65 70 75 80 

Phe Val Asn Gin Cys Pro lie Ala Ser Gly His Ser Phe Leu Tyr Asp 
85 90 95 

Phe His Val Pro Asp Gin Ala Gly Thr Phe Trp Tyr His Ser His Leu 
100 105 110 

Ser Thr Gin Tyr Cys Asp Gly Leu Arg Gly Pro Phe Val Val Tyr Asp 
115 120 125 

Pro Lys Asp Pro His Ala Ser Arg Tyr Asp Val Asp Asn Glu Ser Thr 
130 135 140 

Val lie Thr Leu Thr Asp Trp Tyr His Thr Ala Ala Arg Leu Gly Pro 
145 150 155 160 

Lys Phe Pro Leu Gly Ala Asp Ala Thr Leu lie Asn Gly Leu Gly Arg 
165 170 175 

Ser Ala Ser Thr Pro Thr Ala Ala Leu Ala Val lie Asn Val Gin His 
180 185 iJ 190 

Gly Lys Arg Tyr Arg Phe Arg Leu Val Ser lie Ser Cys Asp Pro Asn 
195 200 205 

Tyr Thr Phe Ser lie Asp Gly His Asn Leu Thr Val lie Glu Val Asp 
210 215 220 

Gly lie Asn Ser Gin Pro Leu Leu Val Asp Ser He Gin He Phe Ala 
225 230 235 240 

Ala Gin Arg Tyr Ser Phe Val Leu Asn Ala Asn Gin Thr Val Gly Asn 
245 250 255 

Tyr Trp Val Arg Ala Asn Pro Asn Phe Gly Thr Val Gly Phe Ala Gly 
250 265 270 

Gly lie Asn Ser Ala He Leu Arg Tyr Gin Gly Ala Pro Val Ala Glu 
275 280 285 

Pro Thr Thr Thr Gin Thr Pro Ser Val lie Pro Leu- He Glu Thr Asn 
290 _ 295 300 

Leu His Pro Leu Ala Arg Met Pro Val Pro Gly Ser Pro Thr Pro Gly 
305 310 315 320 

Gly Val Asp Lys Ala Leu Asn Leu Ala Phe Asn Phe Asn Gly Thr Asn 
325 330 335 

Phe Phe He Asn Asn Ala Thr Phe Thr Pro Pro Thr Val Pro Val Leu 
340 345 350 

Leu Gin He Leu Ser Gly Ala Gin Thr Ala Gin Asp Leu Leu Pro Ala 
355 360 365 

Gly Ser Val Tyr Pro Leu Pro Ala His Ser Thr He Glu He Thr Leu 
370 375 380 



Pro Ala Thr Ala 
385 

Gly His Ala Phe 



Leu Ala Pro Gly 
390 

Ala Val Val Arg 
405 



Ala Pro His Pro 
395 

Ser Ala Gly Ser 
410 



Phe His Leu His 
400 

Thr Thr Tyr Asn 
415 
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Tyr Asn Asp Pro lie Phe Arg Asp Val Val Ser Thr Gly Thr Pro Ala 
420 425 430 

Ala Gly Asp Asn Val Thr lie Arg Phe Gin Thr Asp Asn Pro Gly Pro 
435 440 445 

Trp Phe Leu His Cys His lie Asp Phe His Leu Asp Ala Gly Phe Ala 
450 455 4S0 

He Val Phe Ala Glu Asp Val Ala Asp Val Lys Ala Ala Asn Pro Val 
455 470 475 480 



Pro Lys Ala Trp Ser Asp Leu Cys Pro He Tyr Asp Gly Leu Ser Glu 
15 485 490 495 

Ala Asn Gin 

20 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 amino acids 

(B) TYPE: amino acid 

25 (C) STH:\OT)EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

Met His Thr Phe Leu Arg Ser Thr Ala Leu Val Val Ala Gly Leu Ser 
15 10 15 

Ala Arg Ala Leu Ala Ser He Gly Pro Val Thr Asp Phe His He Val 
20 25 30 



Asn Ala Ala Val Ser Pro Asp Gly Phe Ser Arg Gin Ala Val Leu Ala 
40 35 40 45 



Glu Gly Val Phe Pro Gly Pro Leu He Ala Gly Asn Lys Gly Asp Asn 
50 55 60 

Phe Gin He Asn Val He Asp Glu Leu Thr Asn Ala Thr Met Leu Lys 
65 _ 70 75 80 

Thr Thr Thr He His Trp His Gly Phe Phe Gin His Gly Thr Asn Trp 
85 90 95 

Ala Asp Gly Pro Ala Phe He Asn Gin Cys Pro He Ala Ser Gly Asp 
100 105 110 



Ser Phe Leu Tyr Asn Phe Gin Val Pro Asp Gin Ala Gly Thr Phe Trp 
55 115 120 125 

Tyr His Ser His Leu Ser Thr Gin Tyr Cys Asp Gly Leu Arg Gly Pro 
130 135 140 

60 Phe Val Val Tyr Asp Pro Ala Asp Pro Tyr Leu Asp Gin Tyr Asp Val 

145 150 155 160 



Asp Asp Asp Ser Thr Val He Thr Leu Ala Asp Trp Tyr His Thr Ala 

165 170 175 

Ala Arg Leu Gly Ser Pro Phe Pro Ala Ala Asp Thr Thr Leu He Asn 
180 185 190 
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Gly Leu Gly Arg Cys Gly Glu Ala Gly Cys Pro Val Ser Asp Leu Ala 
195 200 205 

Val lie Ser Val Thr Lys Gly Lys Arg Tyr Arg Phe Arg Leu Val Ser 
210 215 220 

lie Ser Cys Asp Ser Phe Phe Thr Phe Ser lie Asp Gly His Ser Leu 
225 230 235 240 

Asn Val lie Glu Val Asp Ala Thr Asn His Gin Pro Leu Thr Val Asp 
245 250 255 

Glu Leu Thr lie Tyr Ala Gly Gin Arg Tyr Ser Phe lie Leu Thr Ala 
260 265 270 

Asp Gin Asp Val Asp Asn Tyr Trp lie Arg Ala Asn Pro Gly lie Gly 
275 280 285 



lie Thr Thr Gly Phe Ala Gly Gly lie Asn Ser Ala lie Leu Arg Tyr 
20 290 295 300 

Asp Gly Ala Asp Val Val Glu Pro Thr Thr Thr Gin Ala Thr Ser Pro 
305 310 315 320 

25 Val Val Leu Ser Glu Ser Asn Leu T la. Pro Leu Thr Asn Ala Ala Ala 

325 330 335 



Pro Gly Leu Pro Glu Val Gly Gly Val Asp Leu Ala Leu Asn Phe Asn 
340 345 350 

Leu Thr Phe Asp Gly Pro Ser Leu Lys Phe Gin lie Asn Gly Val Thr 
355 360 365 



Phe Val Pro Pro Thr Val Pro Val Leu Leu Gin lie Leu Ser Gly Ala 
35 370 375 380 



Gin Ser Ala Ala Asp Leu Leu Pro Ser Gly Ser Val Tyr Ala Leu Pro 
385 390 395 400 

Ser Asn Ala Thr He Glu Leu Ser Leu Pro Ala Gly Ala Leu Gly Gly 
405 410 ■" 415 

Pro His Pro Phe His Leu His Gly His Thr Phe Ser Val Val Arg Pro 
420 425 430 

Ala Gly Ser Thr Thr Tyr Asn Tyr Val Asn Pro Val Gin Arg Asp Val 
435 440 445 



Val Ser He Gly Asn Thr Gly Asp Asn Val Thr He Arg Phe Asp Thr 
50 450 455 460 



Asn Asn Pro Gly Pro Trp Phe Leu His Cys His He Asp Trp His Leu 
■*65 470 475 480 

Glu Ala Ala Leu Pro Leu Ser Ser Leu Arg Thr Ser Leu Thr Leu Arg 
485 490 495 

Pro Leu Thr Leu Ser Pro Arg Thr Gly Pro Thr Cys Ala Leu Ser Thr 
500 505 510 

Thr Leu Trp Thr His Leu He Thr Ser Gly Phe Ala Ser He He Gin 
515 520 525 

65 Trp Met Met Gly Gly Asn Gly Leu Phe Ala Pro His Ala Leu Ser Phe 

530 535 540 

Leu Gly Ser Gin 
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545 

(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

Met Leu Ser Ser lie Thr Leu Leu Pro Leu Leu Ala Ala Val Ser Thr 
15 10 15 



Pro Ala Phe Ala Ala Val Arg Asn Tyr Lys Phe Asp lie Lys Asn Val 
20 20 25 30 

Asn Val Ala Pro Asp Gly Phe Gin Arg Ser lie Val Ser Val Asn Gly 
35 40 45 

25 Leu Val Pro Gly Thr Leu lie Thr Ala Asn Lys Gly Asp Thr Leu Arg 

50 55 60 

lie Asn Val Thr Asn Gin Leu Thr Asp Pro Ser Met Arg Arg Ala Thr 
65 70 75 80 

Thr lie His Tirp His Gly Leu Phe Gin Ala Thr Thr Ala Asp Glu Asp 
85 90 95 

Gly Pro Ala Phe Val Thr Gin Cys Pro lie Ala Gin Asn Leu Ser Tyr 
35 100 105 110 

Thr Tyr Glu lie Pro Leu Arg Gly Gin Thr Gly Thr Met Trp Tyr His 
115 120 125 

40 Ala His Leu Ala Ser Gin Tyr Val Asp Gly Leu Arg Gly Pro Leu Val 

130 135 "140 



lie Tyr Asp Pro Asn Asp Pro His Lys Ser Arg Tyr Asp Val Asp Asp 
145 150 155 160 

Ala Ser Thr Val Val Met Leu Glu Asp Trp Tyr His Thr Pro Ala Pro 

165 170 175 



Val Leu Glu Lys Gin Met Phe Ser Thr Asn Asn Thr Ala Leu Leu Ser 
50 180 185 190 

Pro Val Pro Asp Ser Gly Leu He Asn Gly Lys Gly Arg Tyr Val Gly 
195 200 205 

55 Gly Pro Ala Val Pro Arg Ser Val He Asn Val Lys Arg Gly Lys Arg 

210 215 220 



Tyr Arg Leu Arg Val He Asn Ala Ser Ala He Gly Ser Phe Thr Phe 

225 230 235 240 

Ser He Glu Gly His Ser Leu Thr Val He Glu Ala Asp Gly He Leu 
245 250 255 



His Gin Pro Leu Ala Val Asp Ser Phe Gin He Tyr Ala Gly Gin Arg 
65 260 265 270 

Tyr Ser Val He Val Glu Ala Asn Gin Thr Ala Ala Asn Tyr Trp lie 
275 280 285 
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Arg Ala Pro .Met Thr Val Ala Gly Ala Gly Thr Asn Ala Asn Leu Asp 
290 295 300 

Pro Thr Asn Val Phe Ala Val Leu His Tyr Glu Gly Ala Pro Asn Ala 

305 3L0 315 320 

Glu Pro Thr Thr Glu Gin Gly Ser Ala lie Gly Thr Ala Leu Val Glu 
325 330 335 

Glu Asn Leu His Ala Leu lie Asn Pro Gly Ala Pro Gly Gly Ser Ala 
340 345 350 

Pro Ala Asp Val Ser Leu Asn Leu Ala He Gly Arg Ser Thr Val Asp 

•3CC -I ->y-r- 



15 355 360 



365 



Gly He Leu Arg Phe Thr Phe Asn Asn He Lys Tyr Glu Ala Pro Ser 
370 375 380 

Leu Pro Thr Leu Leu Lys He Leu Ala Asn Asn Ala Ser Asn Asp Ala 
385 390 395 400 

Asp Phe Thr Pro Asn Glu His Thr He Val Leu Pro His Asn Lys Val 
405 410 415 

He Glu Leu Asn He Thr Gly Gly Ala Asp His Pro He His Leu His 
420 425 430 

Gly His Val Phe Asp He Val Lys Ser Leu Gly Gly Thr Pro Asn Tyr 
30 435 440 445 

Val Asn Pro Pro Arg Arg Asp Val Val Arg Val Gly Gly Thr Gly Val 
450 455 460 

Val Leu Arg Phe Lys Thr Asp Asn Pro Gly Pro Trp Phe Val His Cys 
465 470 475 480 

His He Asp Trp His Leu Glu Ala Gly Leu Ala Leu Val Phe Ala Glu 
485 490 495 

Ala Pro Ser Gin He Arg Gin Gly Val Gin Ser" Val Gin Pro Asn Asn 
500 505 510 

Ala Trp Asn Gin Leu Cys Pro Lys Tyr Ala Ala Leu Pro Pro Asp Leu 
45 515 520 525 

Gin 

50 (2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

60 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

Met Ala Arg Ser Thr Thr Ser Leu Phe Ala Leu Ser Leu Val Ala Ser 
1 5 in 15 



65 



Ala Phe Ala Arg Val Val Asp Tyr Gly Phe Asp Val Ala Asn Gly Ala 
20 25 30 

Val Ala Pro Asp Gly Val Thr Arg Asn Ala Val Leu Val Asn Gly Arg 
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35 40 45 

Phe Pro Gly Pro Leu He Thr Ala Asn Lys Gly Asp Thr Leu Lys He 
^50 55 60 

Thr Val Arg Asn Lys Leu Ser Asp Pro Thr Met Arg Arg Ser Thr Thr 
65 70 75 80 

He His Trp His Gly Leu Leu Gin His Arg Thr Ala Glu Glu Asp Gly 
10 85 90 95 

Pro Ala Phe Val Thr Gin Cys Pro He Pro Pro Gin Glu Ser Tyr Thr 
100 105 110 

15 Tyr Thr Met Pro Leu Gly Glu Gin Thr Gly Thr Tyr Trp Tyr His Ser 

115 120 125 

His Leu Ser Ser Gin Tyr Val Asp Gly Leu Arg Gly Pro He Val He 
130 135 140 

20 

Tyr Asp Pro His Asp Pro Tyr Arg Asn Tyr Tyr Asp Val Asp Asp Glu 
145 150 155 leO 

Arg Thr Val Phe Thr Leu Ala Asp Trp Tyr His Thr Pro Ser Glu Ala 
25 1S5 170 175v- 

Ile He Ala Thr His Asp Val Leu Lys Thr He Pro Asp Ser Gly Thr 
ISO 185 190 



30 



35 



40 



45 



50 



55 



He Asn Gly Lys Gly Lys Tyr Asp Pro Ala Ser Ala Asn Thr Asn Asn 
195 200 205 

Thr Thr Leu Glu Asn Leu Tyr Thr Leu Lys Val Lys Arg Gly Lys Arg 
210 215 220 

Tyr Arg Leu Arg He He Asn Ala Ser Ala He Ala Ser Phe Arg Phe 
225 230 235 240 

Gly Val Gin Gly His Lys Cys Thr He He Glu Ala Asp Gly Val Leu 
245 250 _ 255 

Thr Lys Pro He Glu Val Asp Ala Phe Asp He Leu Ala Gly Gin Arg 
260 265 270 

Tyr Ser Cys He Leu Lys Ala Asp Gin Asp Pro Asp Ser Tyr Trp He 
275. 280 285 

Asn Ala Pro He Thr Asn Val Leu Asn Thr Asn Val Gin Ala Leu Leu 
290 295 300 

Val Tyr Glu Asp Asp Lys Arg Pro Thr His Tyr Pro Trp Lys Pro Phe 
305 310 315 320 

Leu Thr Trp Lys He Ser Asn Glu He He Gin Tyr Trp Gin His Lys 
325 330 335 



His Gly Ser His Gly His Lys Gly Lys Gly His His His Lys Val Arg 

340 345 350 

Ala He Gly Gly Val Ser Gly Leu Ser Ser Arg Val Lys Ser Arg Ala 
60 355 360 365 

Ser Asp Leu Ser Lys Lys Ala Val Glu Leu Ala Ala Ala Leu Val Ala 
370 375 380 

65 Gly Glu Ala Glu Leu Asp Lys Arg Gin Asn Glu Asp Asn Ser Thr He 

385 330 395 400 

Val Leu Asp Glu Thr Lys Leu He Pro Leu Val Gin Pro Gly Ala Pro 
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405 410 415 



Gly Gly Ser Arg Pro Ala Asp Val Val Val Pro Leu Asp Phe Gly Leu 
^ 420 425 430 

Asn Phe Ala Asn Gly Leu Trp Thr lie Asn Asn Val Ser Tyr Ser Pro 
435 440 445 

Pro Asp Val Pro Thr Leu Leu Lys lie Leu Thr Asp Lys Asp Lys Val 
10 450 455 460 

Asp Ala Ser Asp Phe Thr Ala Asp Glu His Thr Tyr lie Leu Pro Lys 
465 470 475 480 

Asn Gin Val Val Glu Leu His lie Lys Gly Gin Ala Leu Gly lie Val 
485 490 495 

His Pro Leu His Leu His Gly His Ala Phe Asp Val Val Gin Phe Gly 
500 505 510 

Asp Asn Ala Pro Asn Tyr Val Asn Pro Pro Arg Arg Asp Val Val Gly 
515 520 525 

Val Thr Asp Ala Gly Val Arg lie Gin Phe Arg Thr Asp Asn Pro Gly 
25 r30 535 540 

Pro Trp Phe Leu His Cys His lie Asp Trp His Leu Glu Glu Gly Phe 
545 550 555 5g0 

Ala Met Val Phe Ala Glu Ala Pro Glu Asp lie Lys Lys Gly Ser Gin 
565 570 575 

Ser Val Lys Pro Asp Gly Gin Trp Lys Lys Leu Cys Glu Lys Tyr Glu 
580 585 590 

Lys Leu Pro Glu Ala Leu Gin 
595 



15 



20 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO : 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Ala Arg Thr Thr Phe Leu Val Ser Val Ser Leu Phe Val Ser Ala 
15 10 15 

55 Val Leu Ala Arg Thr Val Glu Tyr Asn Leu Lys lie Ser Asn Gly Lys 

20 25 30 

lie Ala Pro Asp Gly Val Glu Arg Asp Ala Thr Leu Val Asn Gly Gly 
35 40 45 

60 

Tyr Pro Gly Pro Leu He Phe Ala Asn Lys Gly Asp Thr Leu Lys Val 
50 55 60 

Lys Val Gin Asn Lys Leu Thr Asn Pro Asp Met Tyr Arg Thr Thr Ser 
65 65 70 75 80 

He His Trp His Gly Leu Leu Gin His Arg Asn Ala Asp Asp Asp Gly 
85 90 95 
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Pro Ala Phe Val Thr Gin Cys Pro He Val Pro Gin Ala Ser Tyr Thr 
100 105 110 

Tyr Thr Met Pro Leu Gly Asp Gin Thr Gly Thr Tyr Trp Tyr His Ser 
115 120 125 

His Leu Ser Ser Gin Tyr Val Asp Gly Leu Arg Gly Pro Leu Val lie 
130 135 140 

Tyr Asp Pro Lys Asp Pro His Arg Arg Leu Tyr Asp He Asp Asp Glu 
145 150 155 ISO 



Lys Thr Val Leu He He Gly Asp Trp Tyr His Thr Ser Ser Lys Ala 
15 165 170 175 



He Leu Ala Thr Gly Asn He Thr Leu Gin Gin Pro Asp Ser Ala Thr 
180 185 190 

He Asn Gly Lys Gly Arg Phe Asp Pro Asp Asn Thr Pro Ala Asn Pro 
195 200 205 

Asn Thr Leu Tyr Thr Leu Lys Val Lys Arg Gly Lys Arg Tyr Arg Leu 
210 215 220 

Arg Val He Asn Ser Ser Ala He Ala Ser Phe Arg Met Ser He Gin 
225 230 235 24G 



Gly His Lys Met Thr Val He Ala Ala Asp Gly Val Ser Thr Lys Pro 
30 245 250 255 

Tyr Gin Val Asp Ser Phe Asp He Leu Ala Gly Gin Arg He Asp Ala 
2S0 255 270 

35 Val Val Glu Ala Asn Gin Glu Pro Asp Thr Tyr Trp He Asn Ala Pro 

275 280 285 

Leu Thr Asn Val Ala Asn Lys Thr Ala Gin Ala Leu Leu He Tyr Glu 
290 295 300 

40 

Asp Asp Arg Arg Pro Tyr His Pro Pro Lys Gly Pro Tyr Arg Lys Trp 
305 310 315 320 

Ser Val Ser Glu Ala He He Lys Tyr Trp Lys His Lys His Gly Arg 
'^^ 325 330 335 

Gly Leu Leu Ser Gly His Gly Gly Leu Lys Ala Arg Met Met Glu Gly 
340 345 350 

50 Ser Leu His Leu His Gly Arg Arg Asp He Val Lys Arg Gin Asn Glu 

355 350 365 

Thr Thr Thr Val Val Met Asp Glu Thr Lys Leu Val Pro Leu Glu His 
370 375 380 

Pro Gly Ala Ala Cys Gly Ser Lys Pro Ala Asp Leu Val He Asp Leu 
335 390 395 400 

60 Thr Phe Gly Val Asn Phe Thr Thr Gly His Trp Met He Asn Gly He 

405 410 415 



Pro His Lys Ser Pro Asp Met Pro Thr Leu Leu Lys He Leu Thr Asp 

420 425 430 

Thr Asp Gly Val Thr Glu Ser Asp Phe Thr Gin Pro Glu His Thr He 

435 440 445 
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lie Leu Pro Lys Asn Lys Cys Val Glu Phe Asn lie Lys Gly Asn Ser 
450 455 460 

Gly Leu Gly lie Val His Pro lie His Leu His Gly His Thr Phe Asp 
5 465 470 475 480 

Val Val Gin Phe Gly Asn Asn Pro Pro Asn Tyr Val Asn Pro Pro Arg 

485 490 495 

10 Arg Asp Val Val Gly ,Ala Thr Asp Glu Gly Val Arg Phe Gin Phe Lys 

500 505 510 



15 



35 



40 



55 



Thr Asp Asn Pro Gly Pro Trp Phe Leu His Cys His lie Asp Trp His 

515 520 525 

Leu Glu Glu Gly Phe Ala Met Val Phe Ala Glu Ala Pro Glu Ala lie 
530 535 540 



Lys Gly Gly Pro Lys Ser Val Pro Val Asp Arg Gin Trp Lys Asp Leu 
20 545 550 555 560 

Cys Arg Lys Tyr Gly Ser Leu Pro Ala Gly Phe Leu 
565 570 

25 (2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQtiHNCE CHARACTERISTICS: 

(A) LENGTH: 5 75 -amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

Met Ala Arg Thr Thr Phe Leu Val Ser Val Ser Leu Phe Val Ser Ala 
15 10 15 

Val Leu Ala Arg Thr Val Glu Tyr Gly Leu Lys --lie Ser Asp Gly Glu 
20 25 3 0 



lie Ala Pro Asp Gly Val Lys Arg Asn Ala Thr Leu Val Asn Gly Gly 
45 35 40 45 

Tyr Pro Gly' Pro Leu He Phe Ala Asn Lys Gly Asp Thr Leu Lys Val 
50 55 60 

50 Lys Val Gin Asn Lys Leu Thr Asn Pro Glu Met Tyr Arg Thr Thr Ser 

65 70 75 80 



lie His Trp His Gly Leu Leu Gin His Arg Asn Ala Asp Asp Asp Gly 
85 90 95 • 

Pro Ser Phe Val Thr Gin Cys Pro He Val Pro Arg Glu Ser Tyr Thr 
100 105 110 



Tyr Thr He Pro Leu Asp Asp Gin Thr Gly Thr Tyr Trp Tyr His Ser 

60 115 120 125 

His Leu Ser Ser Gin Tyr Val Asp Gly Leu Arg Gly Pro Leu Val He 

130 135 140 

65 Tyr Pro Lys Asp Pro His Arg Arg Leu Tyr Asp Val Asp Asp Glu Lys 

145 150 155 160 

Thr Val Leu He He Gly Asp Trp Tyr His Glu Ser Ser Lys Ala He 



130 



165 170 175 

Leu Ala Ser Gly Asn lie Thr Arg Gin Arg Pro Val Ser Ala Thr lie 
180 185 190 

Asn Gly Lys Gly Arg Phe Asp Pro Asp Asn Thr Pro Ala Asn Pro Asp 
195 200 205 

Thr Leu Tyr Thr Leu Lys Val Lys Arg Gly Lys Arg Tyr Arg Leu Arg 
210 215 220 

Val He Asn Ser Ser Glu lie Ala Ser Phe Arg Phe Ser Val Glu Gly 
225 230 235 240 

His Lys Val Thr Val lie Ala Ala Asp Gly Val Ser Thr Lys Pro Tyr 
245 250 255 

Gin Val Asp Ala Phe Asp He Leu Ala Gly Gin Arg He Asp Cys Val 
260 265 270 

Val Glu Ala Asn Gin Glu Pro Asp Thr Tyr Trp He Asn Ala Pro Leu 
275 280 285 

Thr Asn Val Pro Asn Lys Thr Ala Gin Ala Leu Leu Val Tyr Glu Glu 
290 295 t 300 

Asp Arg Arg Pro Tyr His Pro Pro Lys Gly Pro Tyr Arg Lys Trp Ser 
305 310 315 320 

Val Ser Glu Ala He He Lys Tyr Trp Asn His Lys His Lys His Gly 
325 330 335 

Arg Gly Leu Leu Ser Gly His Gly Gly Leu Lys Ala Arg Met He Glu 
340 345 350 

Gly Ser His His Leu His Ser Arg Ser Val Val Lys Arg Gin Asn Glu 
355 360 365 

Thr Thr Thr Val Val Met Asp Glu Ser Lys Leu Val Pro Leu Glu Tyr 
370 375 380 

Pro Gly Ala Ala Cys Gly Ser Lys Pro Ala Asp Leu Val Leu Asp Leu 
385 390 395 400 

Thr Phe Gly Leu Asn Phe Ala Thr Gly His Trp Met He Asn Gly He 
405 410 415 

Pro Tyr Glu Ser Pro Lys He Pro Thr Leu Leu Lys He Leu Thr Asp 
420 425 430 

Glu Asp Gly Val Thr Glu Ser Asp Phe Thr Lys Glu Glu His Thr Val 
435 440 445 



He Leu Pro Lys 
450 

Gly He Pro He 
465 

Val Val Gin Phe 



Arg Asp Val Val 
500 

Thr Asp Asn Pro 
515 



Asn Lys Cys He 
455 

Thr His Pro Val 
470 

Gly Asn Asn Pro 
485 

Gly Ser Thr Asp 



Gly Pro Trp Phe 
520 



Glu Phe Asn He 
460 

His Leu His Gly 
475 

Pro Asn Tyr Val 
490 

Ala Gly Val Arg 
505 

Leu His Cys His 



Lys Gly Asn Ser 



His Thr Trp Asp 
480 

Asn Pro Pro Arg 
495 

He Gin Phe Lys 
510 

He Asp Trp His 
525 
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Leu Glu Glu Gly Phe Ala Met Val Phe Ala Glu Ala Pro Glu Ala Val 
530 535 540 

Lys Gly Gly Pro Lys Ser Val Ala Val Asp Ser Gin Trp Glu Gly Leu 
545 550 555 560 

Cys Gly Lys Tyr Asp Asn Trp Leu Lys Ser Asn Pro Gly Gin Leu 
565 570 575 

(2) INFORMATION FOR SEQ ID NO : 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61S amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9; 



Met Lys Arg Phe Phe lie Asn Ser Leu Leu Leu Leu Ala Gly Leu Leu 
25 1 5 10 . c- 15 

Asn Ser Gly Ala Leu Ala Ala Pro Ser Thr His Pro Arg Ser Asn Pro 
20 25 30 

30 Asp lie Leu Leu Glu Arg Asp Asp His Ser Leu Thr Ser Arg Gin Gly 

35 40 45 



Ser Cys His Ser Pro Ser Asn Arg Ala Cys Trp Cys Ser Gly Phe Asp 

50 55 60 

lie Asn Thr Asp Tyr Glu Thr Lys Thr Pro Asn Thr Gly Val Val Arg 
65 70 75 80 



Arg Tyr Thr Phe Asp lie Thr Glu Val Asp Asn Arg Pro Gly Pro Asp 
40 85 90 95 

Gly Val lie Lys Glu Lys Leu Met Leu lie Asn Asp Lys Leu Leu Gly 
100 105 110 

45 Pro Thr Val Phe Ala Asn Trp Gly Asp Thr He Glu Val Thr Val Asn 

115 120 125 



Asn His Leu Arg Thr Asn Gly Thr Ser He His Trp His Gly Leu His 
130 135 140 

Gin Lys Gly Thr Asn Tyr His Asp Gly Ala Asn Gly Val Thr Glu Cys 

145 150 155 ISO 



Pro He Pro Pro Gly Gly Ser Arg Val Tyr Ser Phe Arg Ala Arg Gin 
55 165 ■ 170 175 

Tyr Gly Thr Ser Trp Tyr His Ser His Phe Ser Ala Gin Tyr Gly Asn 
180 185 190 

60 Gly Val Ser Gly Ala lie Gin He Asn Gly Pro Ala Ser Leu Pro Tyr 

195 200 205 



Asp He Asp Leu Gly Val Leu Pro Leu Xaa Asp Trp Tyr Tyr Lys Ser 
210 215 220 

Ala Asp Gin Leu Val He Glu Thr Leu Xaa Lys Gly Asn Ala Pro Phe 

225 230 235 240 
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Ser Asp Asn Val Leu lie Asn Gly Tlar Ala Lys His Pro Thr Thr Gly 
245 250 255 

Glu Gly Glu Tyr Ala lie Val Lys Leu Thr Pro Asp Lys Arg His Arg 
260 255 270 

Leu Arg Leu lie Asn Met Ser Val Glu Asn His Phe Gin Val Ser Leu 
275 280 285 

Ala Lys His Thr Met Thr Val lie Ala Ala Asp Met Val Pro Val Asn 
290 295 300 

Ala Met Thr Val Asp Ser Leu Phe Met Ala Val Gly Gin Arg Tyr Asp 
305 310 315 320 

Val Thr lie Asp Ala Ser Gin Ala Val Gly Asn Tyr Trp Phe Asn lie 
325 330 335 

Thr Phe Gly Gly Gin Gin Lys Cys Gly Phe Ser His Asn Pro Ala Pro 
340 345 350 

Ala Ala lie Phe Arg Tyr Glu Gly Ala Pro Asp Ala Leu Pro Thr Asp 
355 360 365 

Pro Gly Ala Ala Pro Lys Asp His Gin Cys Leu Asp Thr Leu Asp Leu 
370 375 380 

Ser Pro Val Val Gin Lys Asn Val Pro Val Asp Gly Phe Val Lys Glu 
385 390 395 400 

Pro Gly Asn Thr Leu Pro Val Thr Leu His Val Asp Gin Ala Ala Ala 
405 410 415 

Pro His Val Phe Thr Trp Lys lie Asn Gly Ser Ala Ala Asp Val Asp 
420 425 430 

Trp Asp Arg Pro Val Leu Glu Tyr Val Met Asn Asn Asp Leu Ser Ser 
435 440 445 

lie Pro Val Lys Asn Asn lie Val Arg Val Asp Gly Val Asn Glu Trp 
450 455 '^SO 

Thr Tyr Trp Leu Val Glu Asn Asp Pro Glu Gly Arg Leu Ser Leu Pro 
4S5 470 475 430 

His Pro Met His Leu His Gly His Asp Phe Phe Val Leu Gly Arg Ser - 
485 490 495 

Pro Asp Val Ser Pro Asp Ser Glu Thr Arg Phe Val Phe Asp Pro Ala 
500 505 510 

Val Asp Leu Pro Arg Leu Arg Gly His Asn Pro Val Arg Arg Asp Val 
515 520 525 

Thr Met Leu Pro Ala Arg Gly Trp Leu Leu Leu Ala Phe Arg Thr Asp 

530 535 540 

Asn Pro Gly Ala Trp Leu Phe His Cys His lie Ala Xaa His Val Ser 
545 550 555 560 

Gly Gly Leu Ser Val Asp Phe Leu Glu Arg Pro Asp Glu Leu Arg Gly 
565 570 575 

Gin Leu Thr Gly Glu Ser Lys Ala Glu Leu Glu Arg Val Cys Arg Glu 
580 585 590 

Trp Lys Asp Trp Glu Ala Lys Ser Pro His Gly Lys lie Asp Ser Gly 
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595 600 605 



Leu Lys Gin Arg Arg Trp Asp Ala 
510 615 

5 

(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 73 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Gin Gin Ser Cys Asn Thr Pro Ser Asn Arg Ala Cys Trp Thr Asp Gly 
20 1 5 10 15 

Tyr Asp lie Asn Thr Asp Tyr Glu Val Asp Ser Pro Asp Thr Gly Val 
20 25 30 

25 Val Arg Pro Tyr Thr Leu Thr Leu Thr Glu Val Asp Asn Trp Thr Gly 

35 40 45 



Pro Asp Gly Val Val Lys Glu Lys Val Met Leu Val Asn Asn Ser lie 
50 55 50 

lie Gly Pro Thr lie Phe Ala Asp Trp Gly Asp Thr lie Gin Val Thr 
65 70 75 80 



Val lie Asn Asn Leu Glu Thr Asn Gly Thr Ser lie His Trp His Gly 
35 85 90 95 

Leu His Gin Lys Gly Thr Asn Leu His Asp Gly Ala Asn Gly lie Thr 
100 105 110 

40 Glu Cys Pro lie Pro Pro Lys Gly Gly Arg Lys Val Tyr Arg Phe Lys 

115 120 -• 125 



Ala Gin Gin Tyr Gly Thr Ser Trp Tyr His Ser His Phe Ser Ala Gin 
130 135 140 

Tyr Gly Asn Gly Val Val Gly Ala He Gin He Asn Gly Pro Ala Ser 
145 - 150 155 160 



Leu Pro Tyr Asp Thr Asp Leu Gly Val Phe Pro He Ser Asp Tyr Tyr 

50 165 170 175 

Tyr Ser Ser Ala Asp Glu Leu Val Glu Leu Thr Lys Asn Ser Gly Ala 
ISO 185 190 

55 Pro Phe Ser Asp Asn Val Leu Phe Asn Gly Thr Ala Lys His Pro Glu 

195 200 205 



Thr Gly Glu Gly Glu Tyr Ala Asn Val Thr Leu Thr Pro Gly Arg Arg 
210 215 220 

His Arg Leu Arg Leu He Asn Thr Ser Val Glu Asn His Phe Gin Val 
225 230 235 240 



Ser Leu Val Asn His Thr Met Cys He He Ala Ala Asp Met Val Pro 
65 2 4 5 2 5 0 2 5 5 

Val Asn Ala Met Thr Val Asp Ser Leu Phe Leu Gly Val Gly Gin Arg 
260 265 270 



5200-wo 



10 



25 



40 



50 



134 



Tyr Asp Val Val lie Glu Ala Asn Arg Thr Pro Gly Asn Tyr Trp Phe 
275 230 285 

Asn Val Thr Phe Gly Gly Gly Leu Leu Cys Gly Gly Ser Arg Asn Pro 
290 295 300 

Tyr Pro Ala Ala lie Phe His Tyr Ala Gly Ala Pro Gly Gly Pro Pro 
305 310 315 320 

Thr Asp Glu Gly Lys Ala Pro Val Asp His Asn Cys Leu Asp Leu Pro 
325 330 335 



Asn Leu Lys Pro Val Val Ala Arg Asp Val Pro Leu Ser Gly Phe Ala 
15 340 345 350 

Lys Arg Ala Asp Asn Thr Leu Asp Val Thr Leu Asp Thr Thr Gly Thr 
355 360 365 

20 Pro Leu Phe Val Trp Lys Val Asn Gly Ser Ala lie Asn lie Asp Trp 

370 375 380 



Gly Arg Ala Val Val Asp Tyr Val Leu Thr Gin Asn Thr Ser Phe Pro 

385 390 395 400 

Pro 'Gly Tyr Asn lie Val Glu Val Asn Gly Ala Asp Gin Trp Ser Tyr 

405 410 415 



Trp Leu lie Glu Asn Asp Pro Gly Ala Pro Phe Thr Leu Pro His Pro 
30 420 425 430 

Met His Leu His Gly His Asp Phe Tyr Val Leu Gly Arg Ser Pro Asp 
435 440 445 

35 Glu Ser Pro Ala Ser Asn Glu Arg His Val Phe Asp Pro Ala Arg Asp 

450 455 460 



Ala Gly Leu Leu Ser Gly Ala Asn Pro Val Arg Arg Asp Val Ser Met 
465 470 475 480 

Leu Pro Ala Phe Gly Trp Val Val Leu Ser Phe "Arg Ala Asp Asn Pro 
485 490 495 



Gly Ala Trp Leu Phe His Cys His lie Ala Trp His Val Ser Gly Gly 
45 500 505 510 

Leu Gly Val -Val Tyr Leu Glu Arg Ala Asp Asp Leu Arg Gly Ala Val 
515 520 525 



Ser Asp Ala Asp Ala Asp Asp Leu Asp Arg Leu Cys Ala Asp Trp Arg 
530 535 540 



Arg Tyr Trp Pro Thr Asn Pro Tyr Pro Lys Ser Asp Ser Gly Leu Lys 
55 545 550 555 560 

His Arg Trp Val Glu Glu Gly Glu Trp Leu Val Lys Ala 
565 570 



